Distributed Large-Scale Information Filtering

نویسندگان

  • Christos Tryfonopoulos
  • Stratos Idreos
  • Manolis Koubarakis
  • Paraskevi Raftopoulou
چکیده

We study the problem of distributed resource sharing in peer-to-peer networks and focus on the problem of information filtering. In our setting, subscriptions and publications are specified using an expressive attribute-value representation that supports both the Boolean and Vector Space models. We use an extension of the distributed hash table Chord to organise the nodes and store user subscriptions, and utilise efficient publication protocols that keep the network traffic and latency low at filtering time. To verify our approach, we evaluate the proposed protocols experimentally using thousands of nodes, millions of user subscriptions, and two different real-life corpora. We also study three important facets of the load-balancing problem in such a scenario and present a novel algorithm that manages to distribute the load evenly among the nodes. Our results show that the designed protocols are scalable and efficient: they achieve expressive information filtering functionality with low message traffic and latency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PERFORMANCE EVALUATION OF ROUTE-BASED DISTRIBUTED PACKET FILTERING FOR DDOS PREVENTION IN LARGE-SCALE NETWORKS A Thesis

Kim, HyoJeong. M.S., Purdue University, December, 2003. Performance Evaluation of Route-based Distributed Packet Filtering for DDoS Prevention in Large-scale Networks. Major Professor: Kihong Park. This thesis studies performance evaluation of route-based distributed packet filtering (DPF) for spoofed distributed denial of service (DDoS) attack prevention in large-scale networks under dynamic n...

متن کامل

Distributed piecewise filtering design for large-scale networked nonlinear systems

This paper investigates the problem of distributed piecewiseH∞ filtering for discrete-time large-scale nonlinear systems. The considered large-scale system is composed of a number of nonlinear subsystems and exchanges its information through communication network. Each nonlinear subsystem is described by a Takagi-Sugeno (T-S) model, and data-packet dropouts happen intermittently in communicatio...

متن کامل

Distributed multi-agent Load Frequency Control for a Large-scale Power System Optimized by Grey Wolf Optimizer

This paper aims to design an optimal distributed multi-agent controller for load frequency control and optimal power flow purposes. The controller parameters are optimized using Grey Wolf Optimization (GWO) algorithm. The designed optimal distributed controller is employed for load frequency control in the IEEE 30-bus test system with six generators. The controller of each generator is consider...

متن کامل

Hierarchical Filtering-based Monitoring System for Large-scale Distributed Applications

On-line monitoring of large-scale distributed (LSD) applications is an eeective means to observe the appli-cations' behavior at run-time and provide status information required by debugging and management tools. In this paper, we describe and motivate the architecture and the components design of a scalable, high-performance, dynamic and non-intrusive monitoring system for LSD applications. The...

متن کامل

Using WordNet as a Knowledge Base for Measuring Semantic Similarity between Words

In this paper we propose the use of WordNet as a knowledge base in an information retrieval task. The application areas range from information filtering and document retrieval to multimedia retrieval and data sharing in large scale distributed database systems. The WordNet derived knowledge base makes semantic knowledge available which can be used in overcoming many problems associated with the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Trans. Large-Scale Data- and Knowledge-Centered Systems

دوره 13  شماره 

صفحات  -

تاریخ انتشار 2014